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Abstract 

For 1 < a < 2 we derive the asymptotic distribution of the total length 
of external branches of a Beta(2— a, a)-coalescent as the number n of leaves 
becomes large. It turns out the fluctuations of the external branch length 
follow those of Tn~ a over the entire parameter regime, where r n denotes 
the random number of coalescences that bring the n lineages down to one. 
This is in contrast to the fluctuation behavior of the total branch length, 
which exhibits a transition at ao = (1 + y/E)/2 ([IS]). 
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1 Introduction and main results 



The family of Beta coalescents belongs to the so-called A-coalescents introduced 
by Pitman [19] and Sagitov [20], see the survey of N. Berestycki [3]- These 
are characterized by a probability measure A(c?p) on [0,1]. For A — 5q one 
recovers the classical Kingman coalescent. For A having no mass in they 
can be thought as modelling a random gene tree within a species in which 
single reproduction events affect a non- vanishing fraction of the population (see 
[3 [IT] [22] for applications to certain maritime species). The basic ingredient 
for the coalescent dynamics is a Poisson process on M x [0, 1] with intensity 
measure dtv(dp)dt — dtp~ 2 A(dp). With this Poisson process as a random 
input, the (A, n)-coalescent arises as follows: Initially there are n lineages (or 
"particles"), and whenever a Poisson point (t,p) arrives, each of the lineages 
that exist at time t independently takes part in the corresponding coalescence 
event with probability p. By assumption, the pair coalescence rate equals 1, so 
the events that are relevant for coalescences within the sample arrive at finite 
rate. (More formally, this can be viewed as a partition-valued Markov process, 
see below.) 

For the choice v{dp) = const (p/(l — p)) a , < a < 2, the probability measure 
A is the Beta(2 — a, a)-distribution. For a — 1 one obtains the Bolthausen- 
Sznitman coalescent j6| and in the limit a — > 2 one retrieves Kingman's coa- 
lescent. Beta coalescents play a prominent role because of their intimate con- 
nections to a-stable continuum branching processes (see [3]) and also hgure as 
prominent alternatives to Kingman's coalescent from a statistical point of view, 
see 01 |2I]. 

Recently there has been considerable progress in investigating the asymp- 
totic distribution (as the sample size n — > oo) of interesting functionals of Beta 
coalescents, such as the total tree length L n or the total length £ n of all external 
branches, i.e. those branches which end in a leaf. These quantities are also of 
interest in population genetics, since in the infinite sites model L n figures as 
the (random) Poisson parameter of the total number of segregating sites in the 
sample, whereas £ n is the (random) Poisson parameter of the total number of 
mutations carried by single individuals. 

In this paper we will investigate the asymptotic distribution of the suitably 
normalized total external branch length l n in a (Beta(2 — a, a), n)-coalescent, 
with 1 < a < 2. In order to state our main result and to put it into context 
with previous research, we give a few more formal definitions. 

With a labelling of the initial particles by the numbers 1, . . . , n, the merging 
process is described by a sequence IIo, . . . , H Tn of partitions of {1, ... ,n}. Here 
Ilk consists of the classes of a (random) equivalence relation ~& on {1, . . . , n}, 
where i j states that i,j £ {1, . . . , n} have coalesced into one particle after 
k merging events. In particular, IIo = {{1}, ■ • ■ , { n }} and Tl Tn — {{1, . . . , n}}, 
where r„ denotes the (random) total number of merging events. 

The process proceeds in continuous time. At times = To < Ti < • • • < T Tn 
particles merge, and n = Xq > X\ > • • ■ > X Tn = 1 are the corresponding 
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numbers of particles, thus 

Xu = #Ilfe . 

Note that the times Tk and numbers Xk depend on n (which we suppress in the 
notation). For convenience we put Xk = 1 for k > r n . The total branch length 
of the tree is given by 



L n — Xk(Tk+\ — Tk) 



fe=0 

whereas the total length £ n of all external branches can be written as 

T„-l 

l n = £ Y k {T k+l -T k ) , (1) 

fe=0 

with 

Y k = #{i < n : {1} e Ilk} , 

which is the number of external branches, still present up to time Tk- Note that 
Y Tn = 0. For definiteness let Yk = for k > t„. 

The asymptotic distribution of L n and l n has been studied in various publi- 
cations. Drmota et al [TU] and Iksanov and Mohle [13, studied the total length 
of the Bolthausen-Sznitman coalescent. Mohle's investigation [TS] covers the 
case < a < 1 . In that case there is no substantial difference in the asymptotic 
behavior of L n and £ n , more precisely 

Ln d r, , d t-t 

— -> S and — S 

n n 



with 
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exp(-^ t ) dt 



where (£,t)t>o denotes a certain driftless subordinator, depending on a. 

In case 1 < a < 2 the situation is notably different. For this case Berestycki 
et al [HQ] (see the Theorems 9 and 1.9 therein) give the following results: 

L„ a(a-l)T(a) l n 

and — 2 — a 



n 2 ~ a 2 - a L n 

in probability, see also Dhersin and Yuan [9]. We shall see that the difference 
to the case < a < 1 become even more visible on the level of fluctuations. 

Let <^ denote a real- valued stable random variable with index 1 < a < 2, 
which is normalized by the properties 

E(?) = , P(? > x) = o(x- a ) , Pfc < -x) ~ x- a (2) 

for x — > oo. Thus <; is maximally skewed among the stable distributions of 
index a. 
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Also let 



d = a(a - l)r(a) , c 2 = A- ^ ^ 

1 (2 — a) a 

Theorem 1. for ffte B eta- coales cent with 1 < a < 2 



in - c\n 



2-a 



4 c 2 <; . (3) 



-1— a 



The corresponding statement for the Kingman coalescent can be found in Janson 
and Kersting [15] : up to a logarithmic correction in scale the analogue of (|3|) 
also holds for the case a = 2, then with a normal limiting distribution. 

It is of interest to contrast Theorem 1 with the corresponding statement for 
the total length L n , obtained in Kersting [15] . For convenience we reproduce it 
here. 

Theorem 2. Let d = i = 1, 2. T/iera /or 1 < a < 2: 
(i) Ifl<a< |(1 + a/5) (tfius 1 + a — a 2 > 0), then 

n£ +1 - a * (l + a-a 2 )s ' 



(ii) // a = |(1 + V5), then 

(lognjo 



(hi) If |(1 + < a < 2, then 

L n - c' x n 2 - a -4 T) , 

where n is a non- degenerate random variable. 

The contrast between the two theorems is striking. The different regimes in 
Theorem [5] and the transition at the golden ratio otQ = h(l + Vo) is no longer 
present in Theorem [T] Instead we observe that \l n — c\n 2 ~ a \ converges to 
or oo, depending on whether a > a$ or a < olq. This can be understood as 
follows: There are two sources of randomness at work in L n . On the one hand 
fluctuations arise from the randomness within the 'topology' of the tree, more 
precisely from the random variables Xq > X\ > • • • > X Tn . These fluctuations 
are of order n^ +1 ~ a . On the other hand also the waiting times Tk+\ — Tfe 
contribute to the random fluctuations. Most substantial are those fluctuations 
arising close to the root of the tree, that is the times Tk+i — Tfe with k close 
to r n . These fluctuations are of order 1. Now either of these fluctuations may 
dominate the other, depending on the sign of ^ + 1 — a. This is reflected in 
Theorem 2. 
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On the other hand the just mentioned fluctuations of order one do not show 
up asymptotically in l n . This is simply due to the fact that branches close to 
the root of the tree typically are internal branches. 

The fact that for a > ao = 1.61 the external length £ n is decoupled from 
the total length L n is illustrated by the plots in Figure [TJ 




Figure 1: External versus total length. Each plot is based on 1000 coalescent 
realisations with n — 1000. 



As a byproduct, we prove the following result on the decrease of the numbers 
Xk of branches and of external branches. 



Theorem 3. For 1 < a < 2 asra— s-oo 



max 

i<j'<t„ 



— i — op(l) and max T " 3 

t„ i<j<t„ n 



This is illustrated in Figure [2] 



3 \ a 



Mi) 



n = 100 



X 



Y 



n = 1000 



30 40 SO 




n = 10000 



X // 



Y 



Figure 2: Simulation of X = (X Tn -j) and Y — (Y Tn -j) for a = 1.5 and n = 100, 
1000 and 10000. The dashed line shows the curve n(j/T n ) a . 
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2 Heuristics and outline 

The proof of Theorem [T] is given in the next section. It reveals an unexpected 
link between l n and the total number of mergers r„ . In this section we explain 
the heuristics; this may also serve as a guide through the next section. 
For m > 2, the rate at which a coalescence happens within m lineages is denoted 
by A m . We thus have 

T n -1 

L„^ ]T X fe WVAx fc (4) 



fc=0 



and 



in =J2Y k W k /X Xk (5) 

fc=0 

where Wo, W\, . ■ ■ are i.i.d. standard exponential distributed random variables. 
From Lemma 2.2 in [5] we have for m — > oo 

A m = — l --m a + Oim - 1 ) . (6) 
ai [a) 

Combining Q and ([6j suggests, informally written, 

r„-l 

L n « oT(a) J] Xfc 1 -". 

fe=0 

On the other hand, a coupling of (Xk) with a renewal process shows 

X fe « 7 -(r„-fc), (7) 
where 7 = — (see Lemma[5]for the exact statement). This might suggest that 

L n ^aY{a)^r 2 n - a . (8) 
l — a 

Before discussing the (in-)validity of this asymptotics, let us turn to the ex- 
ternal length. Since, given X = (Xk), the random variables Y k result from a 
hypergeometric sampling (see equation (fT5|) below), we obtain (see (|20p) 



E[r fe 1 x] = x k Y[(i-i/Xi) 



Replacing in ([5]) by this conditional expectation and again using (JS]) , we are 
led to 

t»-i k 

£ n « oT(a) £ ^-"fla - W). (9) 

fe=0 i=l 
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Plugging in (O, using the asymptotics J\i=i (l — 7^7 ) ~ ( T « — ^) ^ 7 an d the 
cancellation of (r„ — fc) 1 / 7 against (r„ — /c) 1- ", we arrive at 

tn « ar(a) 7 1 - tt r2- tt (10) 

(see Lemma [5] for a precise statement). The asymptotics of r„ is (see Lemma H} 

n 1 /„ 
r„ « — h n ' const >;. 

7 

Using a Taylor expansion one hence obtains (see (|lip for the exact statement) 

2-a 

+ n 1 - a+1/a const' 

This shows that (as n — > 00) the fluctuations of t^ _q decrease with n for 
a > «o and increase for a < oq. Since (as discussed above) the fluctuations of 
L n are always at least of order one, the approximation ([SJ cannot capture the 
fluctuations. In contrast, the approximation (|10p turns out to be sufficiently fine 
also on the level of fluctuations. The reason for this is the asymptotic flatness of 

the function (x k ) i-> J2k=o x k~ a Ili=iO- ~ V x i) in tne P oint fafe) = (7( T - k)), 
as revealed in the proof of Lemma [7] 




3 Proofs 

Theorem [T] is a consequence of the following two lemmas. 

The first lemma on the number of mergers has been independently obtained 
by Delmas et al jS], Gnedin and Yakubovich [12] and Iksanov and Mdhle [14]. 
(Note that they use a different normalization of the limit law.) We put 

1 

a — 1 

Lemma 4. Let 1 < a < 2. TTiera /or n — > 00 



n 1 /" 7 s+ 1 r(2-a)s ' 

with q described in Section [T] In short the proof of this lemma goes along 
the following lines: The negative increments X^-x — X^ can be asymptotically 
replaced by i.i.d. positive random variables Vk, k > 1, with expectation 7 and 
tail behavior P(V4 > v) ~ r(2^o) w From the theory of stable laws for k —> 00 

X - X fc - 7 fc Vi + • • • + V k - jk a -? 



fcl/a fc l/a r(2-a) 1 /" 

The choice k = r n gives 



7t™ _d 



r, 



V« r(2-a) 1 /« 
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and thus r„ ~ n/j. This entails the lemma. For details we refer to [51 [T2l fTl] . 
The asymptotic expansion in the next lemma discloses the link between t n and 

Lemma 5. Let 1 < a < 2 and s > 0. TAen /or n — >• oo 

4 = ar(a)(a - l)^ 1 ^-" + P (n^- Q+£ + . 

Before proving this key lemma we show that Theorem [T] is a direct consequence 
of the two preceding lemmas: 

Proof of Theorem [7] From Lemma U and a Taylor expansion we obtain 

= (^f~" + (2 - c)(^ a (r n - 2) + P (n-*S) . (11) 

Using 1 < a < 2 and Lemma [5] 

£ n = a(a - l)r(a)n 2 -" + o(2 - a)r(a)n 1_Q (r„ - -) + P (n* +1 - Q - e ) 

for some e > 0. This implies the theorem. □ 
Now we prepare the proof of Lemma [5] by deriving two other lemmas. 
Lemma 6. For any e > 

max j-^- e \X Tn -j - 7 j| = O p (1) , 

as n — y oo . 

Proof. First note that due to monotonicity of for any r/ E N 

max i-°- £ |X Tn _j- - 7j| < max jT^IX^-j - Til + - X v + 777 . 

It is well-known that Xq — X n is stochastically bounded uniformly in n, see e.g. 
formula (7) in |16j . Therefore it is sufficient to prove the claim for the quantity 

max j-«- e \X Tn -j — Til , 

l<J<r n —r} n 

where rj n is any sequence of N-valucd random variables, which is stochastically 
bounded. 

This observation allows us to use a coupling device: Let 2 < S\ < S 2 < • • • 
denote the points of the coalescent's point process /1 downwards from 00, as 
introduced in section 3 of [TB]. There it is shown that one can couple /j, and X 
on a common probability space in such a way that X Tn -j = Sj for all j < r„— r) n , 
where the random variables r\ n are stochastically bounded uniformly in n, see 
Lemma 6 in [16] . Therefore it is sufficient to estimate 

max j-"- e \Sj - Til , 

l<j<a(n) 
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with 

cr(n) = max{j > : Sj < n} , 

where we put So = 1. 
We shall consider 

M r = max j~"~ e \S» — 77I 

( t(2-1)< J < CT (2") J 

for r > 2. For this purpose we use the coupling of [i to a stationary point process 
with points 2 < i? x < i? 2 < • ■ •, as described in section 4 of [16] , Put 

p(n) = max{j > : Rj < n} , 

also with the convention i?o = 1. Since — Rj are i.i.d. random variables 

for j i > 1 with mean 7, we have 

~ 7 _1 n a.s. (12) 

Moreover, since i?j = i?i + V\ + ■ ■ ■ + Vj, where V\, V% ■ ■ . are i.i.d. random 
variables with distribution given in formula (4) in [16 , it follows from Lemma 
9 in [IS] that 

max jS-'lRi - 7 il = 0(2^) a.s. (13) 

p(2'— 1 )<J<p(2'') 

Now equations (19) and (24) in [16] say that for 

iV r = .7(2^) - a^- 1 ) , iV; = p(2 r ) - p(2 r ~ 1 ) 
with r > 2 we have 

|JVr - K\ < D r 

where the random variables 

0<i<N r /\N' r 1 v ' 

fulfil 

P(A- ><) < ci 1 "", i > 0, 
for some c > 0. Hence it follows by the Borel-Cantelli lemma that 

D r = 0(r 7+e ) a.s. , (14) 

and therefore from |er(2 r ) - p{2 r )\ < D r H h D x + 1 

H2 r ) - p(2 r )| = 0(^ +1+E ) a.s. (15) 
for any e > 0. Equation ([12]) implies 

a(2 r ) - 7" 1 2 r a.s. (16) 
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for r — > oo. By the triangle inequality (and reparametrising by j — cr(2 r ) — i)) 
a(2 r - 1 )^ +£ M r < max \Si - 

~ <7(2*-l)<i<<7(2<-) 

•'■ max \S„or-\_j — R 



0<i<N r 



a(2-)-i ~ ■n-(p(2'-)-i)Vp(2'-l) | 



+ max liJ,- — 7j| 

p(2-i)< 3 <K2'V 

+ 7 o max^ |(p(2'') - i) V p^ 1 ) - (a(2'') - t)| . 

In case N' r < N r the first maximum on the right-hand side is not attained for 
i > N' r , because then \S a (2>-)-i ~ R( P (2n-i)v P (2— 1 )\ = S a ^)-i - R p (2— 1 ) is 
decreasing for i > N' r . Also the third maximum is attained either for i = or 
i = N r . This implies 

(T (2 r - 1 )^ +e M r < max \S a{2 r)-i - Rrfr)-t\ 

+ 7 | ( r(2' r )-p(2 r )|+ 7 |a(2 f - 1 )-p(2 f - 1 )| 
+ max — 71 1 

P(2- I )<j< / P(2'-) 

From (fH|) and (fT5|) for e sufficiently small 

M r = ( j(2 r - 1 )-i- e max liL - 7 j| + 0(2-^r 1+7+£ ) 
< 2 max j--- e |i? 7 - 7j| + 0(2- £r ) 



Using (|T3j) we infer 

M r = 0(2- £r / 2 ) a.s. 
Now for n GN, choosing s£N such that 2 S ~ 1 < n < 2 s , since a(n) < n 

s s 

max r-- 8 |$-7i| < E M - = °(E 2_W2 ) = °(!) a ' s ' 

l<?<er(n) * — ' L — ' 
— /_ V ' r=l r=l 

This gives the claim. □ 

Now observe the following: Not only X = (Xo, . . . , X Tn ) is a Markov chain, 
but given X also the sequence Y — (Y 0l . . . , Y Tn ). More precisely let U k be the 
number of particles collapsing to one particle in the fc-th merging event, thus 

X k = AVi -U k + 1 . (17) 

For k > r n we have U k = 1, since by definition 1 = X Tn = X Tn+1 = ■ ■ ■ . The 
numbers of external branches, n = Yq > • • • > Y Tn = 0, follow the recursion 

Y k = Y k _ x - H k (18) 
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with 

(H k | X,Y ,...,Y k _ 1 ) = Kyp(X k _ 1 ,Y k _ 1 ,U k ) , 

where Hyp (TV, M, v) denotes the hypergeometric distribution with parameters 
N, M, v. (These formulas also hold for k > r„.) Thus 

E[Y k \X,Y ,..., F fc _i] = r fe -i - U k ^- = ^zly^ 

-*fe— 1 ^-fe-i 

and 

s^i.^Iepv,^-"^^^-^). (19) 

Afc AfcAfc_i V Aft/ 

Iterating this formula and observing that E[Yo | X] = Yq = Xq we obtain 

E[r fe | x] = x k n* , (20) 

where 

n):= n f 1 -^)' o<i<*. 



fc — ' 



in particular EI* = 1. 
Next we study the auxiliary quantity 11^. 

Lemma 7. Let e > 0. Then for < j < k < r„ as rt — » oo 



fe 

/ T_ — k: \ — / i 
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r ._-*)*( 1+ E^a^ + o,(( T .-»,*--)) (21 > 

and 



n, fe =(^ r |)"(l + Op((r n -fc)-- 1+ 0) , (22) 

where the Op(-) terms hold uniformly for all < j < k < r„. 

Proof. Using a Taylor expansion of log(l + h) we obtain that on the event 
{k < T n } 

k , k 



■9->(-.?z + °(.?z 



with the error term uniformly bounded in n and < j < < r n . In view 
of a Taylor expansion together with Lemma |H] and the fact that J^tn^r ~ = 
log - + O(i) we obtain 

k 



O 



^ : V, J^Cyfo-i) 7 2 (r„- i ) 2 ' ""V (r„-i) 3 

1, T n -j m-^ Xi--f(r n -i) , , ,= 2 + s \ 
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where the Op( ) holds uniformly for all < j < k < r n . Also from Lemma [5] 

i=j+l 1 i=j+l y ' 

therefore for < j < k < r n , since 1 < a < 2, 

\T n -jJ \ .frf . T{T n -iy ') 
This shows (j2Tj) . Finally from Lemma [6j since a > 1, 

E X< f ; 7 lV = ° f ( S K-^- 2+E ) =Op((r B -fe)^) , 
i=i+i ^ n ' i=i+i 

hence also (|2"21 follows. □ 

Proof of Lemma\^ We shall approximate step by step. First let 

fe=0 fc fc=0 

To obtain an approximation to this expression we use the following expansion, 
following from Lemma |5] Uniformly for k < r„ 

x l~ a = (7(t« ~ fc)) 1_Q + (1 - a)(7(r n - fc))-"(X fc - 7 (r n - fc)) 

+ P ((r„ - fc)""- 1 ^ - 7(t„ - fc)) 2 ) 

and thus 

Xl- = (7(r„ ~ ^""(l - X ;^ (7 ! fc) fc) + Op((t„ - . 
Together with Lemma [7] we obtain (taking the first summand n 1_Q apart) 

l n = + ^O 1 - E (l + E Xl_ 7(T " " 



fc=i 



Xfe - 7(T "- fc) +Op(K- fc) i- 2 -)) 



7 2 (t„ - fc) 
Now 

such that (surprisingly) matters simplify to 

in = n 1 -" + ( 7 T n) 1— a ( T n + O p (t#- 1+£ )) 
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or, using Lemma HI 

I» = l X - a rl- a + P (n^- a +c) . (23) 

Next we consider 

i' = V — 

Putting Gfe = Yfe — E[Yfc | X], we obtain, since Go = 0, 

, _ V — 



k=i A fc 



Now from (JTS) and (|T7)) 



F/b = n-i^ flfc, withff fe =H k -U k - 



Xk-i x k 
and thus from (fH?)) 

Gfe = Gfc_i— i?fe 

-*fe-i 

or 

Gfe Gfe_i i?fc 



Iterating this relation we obtain because of Go = 

k 



thus 



[ — [' 



C *-^SM' (24) 



T ^ n* A a T ^H4 T ^ nf 



fe=l fc i=l 10 i=l 4 k=i k 



Now note that given X, Yq, . . . , Yi_i the random variable i/i has a hyperge- 
ometric distribution centered at expectation. In fact (Hi) forms a sequence 
of martingale differences w.r. to the filtration generated by (X,Yq,Yx, . . . ,Yi), 
i > 0, therefore H-y,H<i,... are uncorrelated given X . Also from the formula for 
the variance of a hypergeometric distribution 

E[HS\X i Y ,...,Yi- 1 ]<U i ?t±. (25) 

X-i 

This entails 

i—1 1 k—i ft 



E ^^HE ^pr 
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Using a Taylor expansion similar as in the proof of Lemma [7J and applying 
Lemma [6] we see that A^ 1 is of order (r„ — i)^ 1 uniformly in < i < t„. This 
together with Lemma [7] implies 

e[(i; - In)" I X] 

T~n i + 1 \ 7 1 / \ / t„ A: \ ~ 1 



(r„ - z) 2 V f 1 -; \T n -iJ (r„ - 
Since i = a — 1, this boils down to 



E[([;-[„) 2 |X]=Op(rr«^C/ l (r 1 



Again from Lemma [5] Xj_i is of order r n — i uniformly in < i < t„. Also 
Ui = - X; + 1 < 2(X,-_i - Xj), therefore 



E[(£ - \ n f | A] = P {rt a E Xi-i&i-i X i 

1=1 

Moreover 

tt 'XT* 2 - 

Since Ao = n, we end up with 

E[(['„-[„) 2 |A]=0 P (n 3 - 2 "). 

Chebychev's inequality and dominated convergens now imply for any e > 

P(\l' n - l n \ > n*-«+ £ ) = E[P(|£ - f„| > | A] 

< E[l A (n- 3 + 2 - 2£ E[(C - U 2 | A])] = 

thus \' n - („ = P {ni- a+£ ). Combined with ([23]) we obtain 

In = 7 1_Q r 2 - a + P (ni- a+e + n^ a+£ ) (26) 

for all £ > 0. 

Next we prepare to incorporate the waiting times between mergers into our 
expression. Let W\ , Wi , . . . be independent exponential random variables with 
mean 1, also independent from (X,Y). Let 
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Then 

fe=0 k fe=0 k 

and therefore 

E[(K - O 2 1 *] < E = EV 2Q n§ . 

fe=0 k k=0 

From Lemmas [6l [7] and a < 2 it follows that 

T„-l 

E[(£ - [JJ 2 | X] = P (r'- a E ( T « - fc ) 1_a ) = Op(n 3 ~ 2Q ) . 

k=0 

As above we may conclude [" — \! n — Op(ni~ a+E ) and consequently from (f2f))) 

£ = 7 1_Q ^ _Q + P (n*- Q+£ + ni- Q+e ) . (27) 

In a last step we add the coalescence rate Aj^ into our formulas. From Lemmas 
[Hand [7] 



E 



—^Wk \ X = — 

A, A 



I * = = P (r^ a (r n - fc)- 1 ) , (28) 

k 



where the Op(-) holds uniformly for all < k < T n . Therefore, taking the sum 
in (|28p and using Markov's inequality, 

r„-l y r„-l 

E ~V^+l Wk = °p{ T n~ a E ( T « - *)"*) = l0 S T ™) ■ 

fe=0 k k=0 

Combined with © and (|27l) this gives 

r„-l y 

Y ^W k = ar(a) 7 1 - tt r 2 - Q + P {n^- a+e + ? ^- Q+e ) . 
^ Xx * 

Now the left-hand quantity is equal to t n in distribution. This proves Lemma [SJ 

□ 

Proof of Theorem [3l The first statement is a direct consequence of Lemma [6] 
with e = 1 — 1 /a and Lemma 0] For the proof of the second one note that 
in view of monotonicity of (Yfe) it is sufficient to show that for < c < 1 and 

h n ^ CT n 



U (l-c) Q 
n 



in probability. From (|2U1) , (|2"21 and the first statement of the theorem 

E ^!^l_^, n ^ (1 _ c) . (1 _ cr i 
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in probability. Thus it remains to prove that G kn = Y krl — E[Yfc„ | X] is of order 
op(n). Now from (f24|). (l25j) and the explanation given there, 

E[G 2 k | x] < (x k n*r £ TT^i^n- 1 . 

As in the proof of Lemma [5] (see the arguments following the inequality (|25|) ) 
we mutually replace Xi and r„ — i as well as LTq and (r n — i) 1 / 7 jrlJ 1 to obtain 
uniformly in fc 

and uniformly in fc 

E[G 2 fc | X] = Opf ^":^ 2 " ^") = Op(t„) = P (n) . 
This implies that Gk„ is of order Op(y/n), which gives our claim. □ 
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